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Haemophilus parasuis is a member of the family Pasteurellaceae and is the etiologic agent of 
Glasser's disease in pigs, a systemic syndrome associated with only a subset of isolates. The 
genetic basis for virulence and systemic spread of particular H. parasuis isolates is currently 
unknown. Strain 29755 is an invasive isolate that has long been used in the study of Glasser's 
disease. Accordingly, the genome sequence of strain 29755 is of considerable importance to 
investigators endeavoring to understand the molecular pathogenesis of H. parasuis. Here we 
describe the features of the 2,224,137 bp draft genome sequence of strain 29755 generated 
from 454-FLX pyrosequencing. These data comprise the first publicly available genome se- 
quence for this bacterium. 



Introduction 



Classification and features 



H. parasuis is an obligate pathogen of swine [1]. 
The bacterium is often carried in the nasal passag- 
es [2], but not the lungs [3], of healthy pigs. 
Through unknown mechanisms some strains can 
spread systemically and may be isolated from the 
meninges, lungs, serosa, joints, and blood. H. para- 
suis strain 29755 (IA84-29755), though not the 
type strain, has been used extensively in a variety 
of investigations [4-8] and is the most fully cha- 
racterized strain of the species. Originally cultured 
at Iowa State University from a pig exhibiting 
Glasser's disease, 29755 is a serovar 5 isolate [9], 
a class recognized as highly virulent and frequent- 
ly isolated from respiratory and systemic sites 
[9,10]. Of the 15 recognized serovars, serovar 5 
strains are isolated more frequently worldwide 
than any other [11]. Strain 29755 has been used 
as a component of at least one commercially avail- 
able H. parasuis vaccine (Suvaxyn M. hyo - para- 
suis, Fort Dodge Animal Health). 



The genus Haemophilus belongs to the Gammapro- 
teobacteria and is classified in the family Pasteu- 
rellaceae [12] (Table 1). A phylogenetic tree based 
on 16S ribosomal RNA sequences is depicted in 
Figure 1 for H. parasuis and related organisms. 



H. parasuis is a small, non-motile, rod-shaped bac- 
terium [1] (Figure 2). The presence of a capsule is 
variable and may affect colony and cellular mor- 
phology [30]. Growth of the bacterium in vitro is 
dependent on the coenzyme nicotinamide adenine 
dinucleotide (NAD, or V factor) [31] but, in con- 
trast to some other members of the genus, does 
not require porphyrins like hemin (X factor) [32]. 
Plating on Casman Agar Base (BBL) supplemented 
with 1% (w/v) NAD (Sigma) and 5% GIBCO fil- 
tered horse serum (Invitrogen) or on chocolate 
agar produces small, translucent colonies that ap- 
pear within 24 hours and reach full size in approx- 
imately two days. Colonies are nonhemolytic 
when grown on blood agar [1]. 
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Table 1. MIGS classification and general features of H. parasuis strain 29755. 
MIGS ID Property Term 



Evidence code 



MIGS-6 

MIGS-6.3 

MIGS-22 

MIGS-15 

MIGS-14 

MIGS-4 

MIGS-5 

MIGS-4.1 

MIGS-4.2 

MIGS-4.3 

MIGS-4.4 



Current classification 



Gram stain 

Cell shape 

Motility 

Sporulation 

Temperature range 

Optimum temperature 

Carbon source 

Energy source 

Terminal electron receptor 

Habitat 

Salinity 

Oxygen requirement 

Biotic relationship 

Pathogenicity 

Geographic location 

Sample collection time 

Latitude 

Longitude 

Depth 

Altitude 



Domain Bacteria TAS [13] 

Phylum Proteobacteria TAS [14] 

Class Cammaproteobacteria TAS [15,16] 

Order Pasteurellales TAS [15,17] 

Family Pasteurellaceae TAS [18,19] 

Genus Haemophilus TAS [20-22] 

Species Haemophilus parasuis TAS [20,23] 
Strain 29755 
Serotype 5 

negative TAS [1 ] 

rods (pleomorphic) TAS [1] 

nonmotile TAS [1] 

non-sporulating TAS [1] 

mesophile (20°C-37°C) TAS [12] 

35°C-37°C TAS [12] 

saccharolytic TAS [24] 

chemoorganotroph TAS [24] 

Oxygen TAS [25] 
Host, swine upper respiratory tract TAS [1] 

1-1.5% TAS [12] 

facultative TAS [12] 

obligate pathogen of swine TAS [1 ] 

mild to severe TAS [1] 

Iowa NAS 

1970s NAS 
not reported 
not reported 
not reported 
not reported 



Evidence codes - NAS: Non-traceable Author Statement (i.e., not directly observed for the living, iso- 
lated sample, but based on a generally accepted property for the species, or anecdotal evidence). 
These evidence codes are from of the Gene Ontology project [26] 
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Actinobacillus pleuropneumoniae (T); M75074 
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Escherichia coli (T); ATCC 1 1775T 



0.01 

Figure 1 . Phylogenetic tree based on 1 6S rRNA of H. parasuis 29755 and type strains of some closely related spe- 
cies and other genera within the Pasteurellaceae. Also included is the only additional H. parasuis strain for which 
a genome sequence has been reported, SHOT 65. The tree was generated with the tree-builder available from the 
Ribosomal Database Project[27] using the Weighbor (weighted neighbor-joining) algorithm [28] with Jukes- 
Cantor distance correction [29]. Numbers to the left of branches indicate the percentage of trees in which each 
branch was represented in 100 replicates. An E. coli type strain was used as an outgroup. 




Figure 2. Scanning electron micrograph of H. parasuis 29755 



H. parasuis grows under normal atmosphere at 
37°C, although added humidity and 5% CO2 may 
improve growth. 

Genome sequencing and annotation 

Genome project history 

H. parasuis strain 29755 was selected for sequencing 
because it has long been used in the study of 
Glasser's disease. Pyrosequencing (454 Life 



Sciences) was performed at the State University of 
New York, University at Buffalo Center of Excellence 
in Bioinformatics and Life Sciences. The draft ge- 
nome sequence is deposited in GenBank 
(NZ_ABKM00000000). Summary project informa- 
tion is shown in Table 2 according to the Minimum 
Information about a Genomic Sequence (MIGS) rec- 
ommendations [34] and the genome content is 
summarized in Table 3. 
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Growth conditions and DNA isolation 

H. parasuis 29755 was grown from a frozen seed 
stock for two days under 5% CO2 at 37°C on Cas- 
man Agar Base (BBL) supplemented with 1% (w/v) 
NAD (Sigma) and 5% GIBCO filtered horse serum 
(Invitrogen). Following growth, a single colony was 
used to inoculate 5 ml of brain-heart infusion me- 
dium supplemented with 10 |ig/ml NAD and 10 
|ig/ml hemin (sBHI) and the culture was incubated 
overnight at 37°C and 185 rpm. The next day, 2 ml 



of the culture were added to 100 ml of sBHI and the 
bacterium was again allowed to grow overnight to 
stationary phase at 37°C and 185 rpm. Bacterial 
cells were pelleted by centrifugation at 4000 * g for 
10 minutes. The pellet was resuspended and used 
as the source of genomic DNA purified with the 
QIAGEN Blood & Cell Culture DNA Kit, as recom- 
mended by the manufacturer. The final preparation 
contained 1.12 ug/ul genomic DNA as determined 
by UV absorption spectrometry. 



Table 2. Genome sequencing project information 



MIGS ID 


Property 


Term 


MIGS-28 


Libraries used 


one 454 pyrosequence standard library 


MIGS-29 


Sequencing platforms 


454 (FLX) 


MIGS-30 


Assemblers 


Newbler 


MIGS-31 


Finishing quality 


draft 


MIGS-31.2 


Fold coverage 


28x 


MIGS-32 


Gene calling method 


Glimmer, GeneMark [33] 




Genome Database release 


February 14, 2008 




Genbank ID 


NZ_ABKM00000000 




Genbank Date of Release 


February 14, 2008 




GOLD ID 






Project relevance 


food animal pathogenesis 



Table 3. Genome statistics 



Attribute 


Value 


% of total 3 


Size (bp) 


2,224,137 


100.0% 


G+C content (bp) 


867,413 


39.0% 


Coding region (bp) 


1,890,516 


85.0% 


Total genes 


2,309 


100.0% 


RNA genes 


58 


2.5% 


Protein-coding genes 


2,244 


97.2% 


Pseudogenes 


none b 


0.0% 


Genes in paralog clusters 


nd c 




Genes assigned to COGs 


1,926 


83.4% 


PSORT cytoplasmic 


1,181 


50.4% 


PSORT extracellular 


5 


0.2% 


PSORT outer membrane 


51 


2.2% 


PSORT periplasmic 


52 


2.2% 


PSORT unknown 


1,055 


45.0% 



a Based either on the size of the genome in base pairs or 
the total number of protein coding genes in the anno- 
tated genome 

''Based on preliminary analysis the of draft genome 
c nd = not determined 
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Genome sequencing and assembly 

Library preparation yielded 9.65 x 10 8 mole- 
cules/|il of DNA with a mean size of approximately 
600 nucleotides, as determined with a RNA6000 
Pico chip on an Agilent 2100 Bioanalyzer. Emul- 
sion PCR was performed at a concentration of 2 
molecules per bead. Following sequencing, contigs 
were assembled using the 454 Newbler assemb- 
ler. 

Genome annotation 

Genes were identified manually using GeneMark 
and automatically using Glimmer as part of the 
NCBI draft genome submission pipeline. Translated 



protein sequences were analyzed using PSORTb 
v.2.0 [35] to predict final location within the cell 
and assigned to COG functional categories (Table 
4). 

Genome properties 

The draft genome is 2,224,137 bp and is likely 
comprised of one circular chromosome with a G+C 
content of approximately 39% (Figure 3). For dis- 
play, contigs were assembled end-to-end with 
twenty "N" bases between contigs. Orientation 
and order of contigs will change when the genome 
sequence is closed. 




Figure 3. Graphical circular map of the H. parasuis 29755 draft pseudogenome. From the outside to the center: 
open reading frames (ORFs) on the forward strand (one ring for each reading frame), start and stop codons for 
forward and reverse strands, ORFs on the reverse strand, GC content, and GC skew. The map was generated 
using CGView Server [36,37]. 



66 



Standards in Genomic Sciences 



Mullins ef al. 



Table 4. Number of genes associated with the general COG functional categories 



Code 


Value 


%age a 


Description 


J 


168 


6.55 


Translation 


A 


1 


0.03 


RNA processing and modification 


K 


127 


4.96 


Transcription 


L 


166 


6.48 


Replication, recombination and repair 


B 


0 


0.00 


Chromatin structure and dynamics 


D 


33 


1.29 


Cell cycle control, mitosis and meiosis 


Y 


0 


0.00 


Nuclear structure 


V 


32 


1.25 


Defense mechanisms 


T 


48 


1.87 


Signal transduction mechanisms 


M 


134 


5.23 


Cell wall/membrane biogenesis 


N 


16 


0.62 


Cell motility 


Z 


0 


0.00 


Cytoskeleton 


w 


24 


0.94 


Extracellular structures 


u 


75 


2.93 


Intracellular trafficking and secretion 


o 


101 


3.94 


Posttranslational modification, protein turnover, chaperones 


c 


115 


4.49 


Energy production and conversion 


G 


139 


5.42 


Carbohydrate transport and metabolism 


E 


175 


6.83 


Amino acid transport and metabolism 


F 


57 


2.22 


Nucleotide transport and metabolism 


H 


97 


3.78 


Coenzyme transport and metabolism 


1 


43 


1.68 


Lipid transport and metabolism 


P 


116 


4.53 


Inorganic ion transport and metabolism 


Q 


25 


0.96 


Secondary metabolites biosynthesis, transport and catabolism 


R 


234 


9.13 


General function prediction only 


S 


197 


7.69 


Function unknown 




440 


17.16 


Not in COGs 



a f3ased on the total number of protein coding genes in the annotated genome 
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